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abstract 

Recently there has been remarkable progress in the complex Langevin method, which 
aims at solving the complex action problem by complexifying the dynamical variables in 
the original path integral. In particular, a new technique called the gauge cooling was in¬ 
troduced and the full QCD simulation at finite density has been made possible in the high 
temperature (deconhned) phase or with heavy quarks. Here we provide a rigorous justi¬ 
fication of the complex Langevin method including the gauge cooling procedure. We first 
show that the gauge cooling can be formulated as an extra term in the complex Langevin 
equation involving a gauge transformation parameter, which is chosen appropriately as a 
function of the configuration before cooling. The probability distribution of the complexi¬ 
fied dynamical variables is modified by this extra term. However, this modification is shown 
not to affect the Fokker-Planck equation for the corresponding complex weight as long as 
observables are restricted to gauge invariant ones. Thus we demonstrate explicitly that the 
gauge cooling can be used as a viable technique to satisfy the convergence conditions for the 
complex Langevin method. We also discuss the “gauge cooling” in 0-dimensional systems 
such as vector models or matrix models. 
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1 Introduction 


Monte Carlo calculation has been playing an important role in nonperturbative studies of 
quantum field theories. However, the usefulness becomes quite limited when the action S 
becomes complex because the integrand e -S in the path integral can no longer be regarded as 
the Boltzmann weight. This occurs in many interesting cases such as QCD at finite density 
or with a theta term, gauge theories with a Chern-Simons term, chiral gauge theories, 
and so on. It also occurs in supersymmetric gauge theories and matrix models relevant to 
nonperturbative studies of superstring theory. 

Amongst various approaches to this complex action problem, the one based on the 
complex Langevin equation has been attracting a lot of attention recently. The original 
idea was proposed by Parisi [T] and Klauder [2] in 1983, and since then it was applied to 
various systems with complex actions. A salient feature of the method is that it works 
beautifully in some fairly nontrivial cases, but it fails completely in the other cases. For a 
long time, theoretical understanding of this feature was missing, and that led to the gradual 
decline of interest in this approach. However, in 2011 one of the problems of the method in 
the case it fails has been clearly identified [Sill]. The authors first derived the key relation 
between the complex Langevin process and the Fokker-Planck equation for the complex 
weight. Then it was found that the integration by parts used in the derivation may not 
be justified unless the probability distribution of the complexified dynamical variables is 
suppressed strongly enough when they take large values. 

The gauge cooling has been proposed to cure this problem in the case of gauge theories 
[5j. It has been applied to finite density QCD in the heavy dense limit and shown to work 
in the whole parameter regime in that limit [6] . More recently it has been applied to finite 
density QCD without taking the heavy dense limit, and it is shown to work at least in the 
deconhned phase [7]. These are already quite remarkable since the cases that have been 
studied include a parameter region, which would be hardly accessible by other methods 
such as reweighting. On the other hand, it is also realized in a solvable gauge theory with 
a complex coupling constant that there exists some parameter regime in which the gauge 
cooling cannot cure completely the insufficient fall-off of the probability distribution [8]. 

In fact there is another problem anticipated to occur when one applies the complex 
Langevin method (CLM) to QCD with light quarks at low temperature. This was realized 
in ref. [0] by applying the CLM to the Random Matrix Theory for finite density QCD. It 
turned out that a naive implementation of the method fails as the quark mass is decreased 
(See ref. [10], however.). The reason for this failure was speculated to have something to do 
with the logarithmic singularity in the action due to the fermion determinant mmm- ° n 
the other hand, it was also pointed out HZ] that the problem occurs due to a singular drift 
term, which breaks the requirement of holomorphy in the derivation of the key relation 
between the complex Langevin process and the Fokker-Planck equation for the complex 
weight M. Recently two of the authors (J.N. and S.S.) [T3]J have argued that it is 
actually the integration by parts used in the derivation that is invalidated by the singular 
drift term. According to this understanding, the problem can be avoided if the probability 
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distribution is suppressed strongly enough near the singularity. In a separate paper pH] we 
show that this can also be achieved by the gauge cooling if one chooses appropriately the 
quantity that should be reduced by the cooling procedure. 

While intuitive arguments for justification of the gauge cooling are given in the liter¬ 
ature (See, for instance, section 5 of ref. [15]). a rigorous justification has been missing. 
Actually there is even some suspicion in the community that the procedure may not be 
fully justified. Some of the concerns we have encountered in private communications are: 
1) the gauge cooling uses a complexified gauge symmetry, which is not a symmetry of the 
original system. 2) the noise term in the complex Langevin equation is invariant under the 
original gauge transformation but not under the complexified gauge transformation. 3) the 
quantity one tries to reduce by the complexified gauge transformation is not holomorphic, 
which may spoil the justification of the CLM. In view of this situation, here we provide a 
rigorous justification of the CLM including the gauge cooling procedure. We first show that 
the gauge cooling can be formulated as an extra term in the complex Langevin equation 
involving a gauge transformation parameter, which is chosen appropriately as a function 
of the configuration before cooling. The probability distribution of the complexified dy¬ 
namical variables is modified by this extra term. However, this modification is shown not 
to change the Fokker-Planck equation for the corresponding complex weight as long as the 
observables are restricted to gauge-invariant ones. Thus we conclude that the gauge cooling 
can be used to realize the properties of the probability distribution that are required for its 
relation to the complex weight without affecting the Fokker-Planck equation. 

We also discuss the “gauge cooling” in O-dimensional systems such as vector models 
or matrix models, which is simpler than in lattice gauge theory. Apart from pedagogical 
purposes, we consider that it is useful, for instance, in studying the matrix models relevant 
to superstring theory mm- 

The rest of this paper is organized as follows. In section [2] we briefly review the Langevin 
method starting from the well-established case of real action, and discuss the conditions 
for correct convergence in the case of complex action. In section [3] we discuss the “gauge 
cooling” in O-dimensional systems and provide its justification. In section [4] we discuss 
the application of the CLM to lattice gauge theory. In section [5] we present an explicit 
justification of the gauge cooling in lattice gauge theory. Section [6] is devoted to a summary 
and discussions. 


2 Brief review of the Langevin method 

In this section we briefly review the Langevin method. (As a comprehensive review on 
this subject, we recommend ref. [IB].) Here we consider a system of n real variables x k 
(k = 1, • • • , n) given by the partition function 


Z 


dxe~ s{x) 


\dx k e ~ s <*> , 


( 2 . 1 ) 
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where the action S(x) is a function of x — (aq, ■ ■ ■ ,x n ). We start with the well-established 
case of real action, which is known also under the name of stochastic quantization. Then 
we discuss the case of complex action focusing on the conditions for correct convergence. 


2.1 the case of real action 


When the action S(x) is real, we can use the ordinary Langevin method to study this 
system [193. Introducing a fictitious time t, we consider the f-evolution governed by the 
Langevin equation 


4 r,) W = ~TZT+Vk(t) , 


dxi 


( 2 . 2 ) 


where rjk(t) are probabilistic variables obeying the probability distribution e _ 4 f dt Vk(t) 2 . The 
first term and the second term on the right-hand side of the Langevin equation (12.2j) are 
commonly called the drift term and the noise term, respectively, for historical reasons. 

The probability distribution of x^(t) can be defined as 


where the expectation value 


P(x,t) = (n 5 ( Xfc ~ x k (*) 

k 

) r] is defined by 

J Vr] ■ ■ ■ q~\ S dtr ik{t ) 2 


(2,3) 




f T>r/e ~^ dtTlk ^ 2 
Using this notation, one obtains, for instance, 

= 2 Ski 3(ti —1 2 ) ■ 


(2.4) 


(2.5) 


One can actually show that P(x, t ) satisfies the Fokker-Planck (FP) equation (See section 
12.21 for derivation.) 


dP 

~dt 


d f dS 


dxi. V dx 


+ 


_d_ 

dxi 


P 


which has a time-independent solution 


D 

1 time—indep 


- _ „~S(x) 

z 


(x) — — e 


( 2 , 6 ) 


(2.7) 


Under quite general conditions [18], one can show that the eigenvalues of the differential 
operator acting on P on the right-hand side of (|2.6|) are strictly negative except for the zero 
eigenvalue corresponding to (12.71) . This implies that the probability distribution P(x,t) ap¬ 
proaches (12.7p exponentially. One can therefore obtain a vacuum expectation value (VEV) 
with respect to the partition function (12.ip as 


^0((r)^ / dxOiyX') -Ptime—indep(^) 
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= lim / dx O(x) P(x,t) 
t—>0o J 

i rto+T 

= lim — / dtO[x^\t)) . (2.8) 

T^oo T J tQ \ J 

In the last step, the statistical average over r] is replaced by the time average assuming the 
ergodicity of the stochastic process as is done in usual Monte Carlo methods. 


2.2 the discretized Langevin equation 


When one tries to solve the Langevin equation 
fictitious time t and solve, for instance^ 


numerically, one has to discretize the 


x k\ t + e ) = x k\ t ) + e (^-^ + r lk(t) 4 S J , (2.9) 

where the probabilistic variables %(£) obey the probability distribution e“4 £ St Let 

us rescale them as fj k = \[tr] k so that they obey the probability distribution euL 
and hence, in particular, 


(fjk(ti) =2 8 k i8 tlM . (2.10) 

With this normalization, the discretized Langevin equation ( I2.9j) becomes 

+ e) = Xk } (t) ~ e— + y/efj k {t) . (2.11) 

Below we omit the tilde on rj to simplify the notation. 

With this discretized version, we can derive the FP equation (12.61) in a more elementary 
manner than in the continuum [[18]. Let us consider a test function f(x) and its expectation 
value 


J { x(V) u = j dxf(x)P(x; t) 
at a fictitious time t. The devolution of this quantity is given by 
[/(z w («+ <:))) 


d£_ 

dx k 


—e 


dS 

dx k 


+ > + 0(f2 ' 


( 2 . 12 ) 


4 There are more sophisticated ways for discretization that can be used to reduce the systematic errors 

due to the discretization. See ref. [2D] and references therein. 
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=e / dx 


»fM + U)p( X ;t) + 0^ 


dx k dxk dxl 


—e [ dxf{x)-£- + P + 0(e 2 ) . 


dxk V dxb dxi 


(2.13) 


Here we have used 

i a 2 / 


2 dxkdxi 


= i e (hg_) (»(*),(*))__ = « (g) , (2.14) 


which follows from the fact that the function n 9 / j s evaluated at x — x^(t), which 
depends only on rj( 0), 77 (e), • • •, rj(t — e), but not on 77 (f). Using (j2.12|) . the same quantity 
(I2.13P should be written as 

/(^’(He))) - ^/(x (j?) (f))) = f dxf(x)(p(x;t + e)-P(x;tf) . (2.15) 


Since (12.13ji and (12.15ft should be equal for an arbitrary /(x), one obtains 

r> f f) S' f) \ 

P(x; f + e) — P(x; t ) = e-— ( —-h -— ) P + 0(e 2 ) . 


dx k \dx k dx k 
Thus, in the e —> 0 limit, one obtains (12.61) . 


(2.16) 


2.3 the case of complex action 

Let us apply the same method to the case in which the action S' is a complex-valued 
function of the real variables x k (k — 1, • • • , n). In that case, however, the first term on the 
right-hand side of the Langevin equation (j2.2j) becomes complex, which means that xjg (f) 
becomes complex even if one starts from a real configuration xjg (0) G M. Let us therefore 
complexify the variable^] as x k H» z k = x k + iy k , and solve the complex Langevin equation 


ihW = 


dS 

dz k 


+ Vk(t) 


(2.17) 


where the action S is now considered as a function of the complex variables z k {k = 1 , • • ■ ,n) 
by analytic continuation. It is important for the method that the action S(z) thus obtained 
is a holomorphic function of z k . The probabilistic variables rj k (t) in (12. 1 7p are, in general, 
complex 


Vk(t) = V^\t) +iVk > ( t ) > 


(i), 


(2.18) 


5 In this respect, there is a closely related approach based on the so-called Lefschetz thimble mm, 
which has attracted much attention recently. See refs. [231 fUl [Tal [251 [77] and references therein. 
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. The probability distribu- 


_I f rff r 1 I 1 ,„(!) 

and obey the probability distribution e 4J ljv R^ fe N i k ' 
tion corresponding to (12. 3 p is defined as 


P( x ,y;t) = , 


where the expectation value ( • • • ) v is defined by 


<■■■>,= 


fVy- e ~^ Sdt{ ^ ® a+ *irt W 
/ Vr] e~^ dt {{t)2+ ^ ^ (t)2} 
With this notation, we have, for instance, 

= 2N R 5 k i 5(t i - t 2 ) , 
^ (I) (^))^ = 2W 4; £(*i - * 2 ) , 

vl 1] = 0 . 

In what follows we assume 


(2.19) 


( 2 . 20 ) 


( 2 . 21 ) 


N-r — Ni = 1 


( 2 . 22 ) 


for a reason that becomes clear later. For practical purposes, one should actually use 
Nr — 1, Ni — 0, corresponding to real rjk(t) with the distribution (12. 4ft . to reduce the 
excursion in the imaginary directions 00], which spoils the validity of the method as we 
review below. 

Repeating the analysis given in section 12.21 one can easily show that P(x, y; t) satisfies 
the FP-like equation 


dP 

~dt 



+ N } 


R 


_d_ 

dxi 


P 


d 

dy k 



+ Ni 


() 

dy k 


P . 


(2.23) 


In fact, for observables O(x) that admit holomorphic extension to 0(x + iy), one can show 
under certain conditions that there exists a complex function p(x]t), which satisfies 


J dxdy 0(x + iy)P(x,y;t) 


dx 0(x)p(x ; t) , 


(2.24) 


and obeys the differential equation 

dp d f OS d \ 

dt dx k V dx k dx k ) ^ ’ 


(2.25) 
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which is formally the same as the FP equation 112.61) . Although there is an important 
difference that the action S and p are now complex, the FP equation 112.25p still has a 
time-independent solution 

Ptime-indep(^) = ^ e _S(x) . (2.26) 

The convergence to this solution in the t —y oo limit requires that all the eigenvalues of 
the operator acting on p on the right-hand side of (12.25j) should have strictly negative real 
part except for the zero eigenvalue corresponding to (j2.26j) . While this is not guaranteed in 
general unlike in the real action case, one can argue that the convergence to fl2.26|) should 
occur if the relation (12.24jl holds and the solution to the FP-like equation (j2.23j) converges to 
some function uniquely. Suppose the operator acting on p has an eigenvalue with a positive 
real part. Then the overall magnitude of p increases exponentially with f, and f!2.24p cannot 
be satisfied. Also, suppose the operator acting on p has an eigenvalue with a vanishing real 
part other than the zero eigenvalue corresponding to (12.26[) . Then the asymptotic behavior 
of p depends on the initial condition, and 112.24p cannot be satisfied with P having a unique 
asymptotic behavior. To the best of our knowledge, this argument has been given for the 
first time in ref. [13] with explicit examples. 

Thus, provided that the relation (I2.24p holds and the solution to the FP-like equation 
(I2.23p converges to some function uniquely, we can calculate the VEV with respect to the 
partition function (12.ip as 


/ dxOix^Px ime—indep(•£) 


= lim / dxO(x) p(x;t) 


£—>•00 


= lim / dx dy 0(x + iy) P(x,y,t) 


£—>•00 


= lim (ofx^^t) +iy( v \t) 


£—>•00 

1 pto+T , v 

= lim - / dtO(x^(t)+iy^(t)) . 

T^oo T J to V t 


(2.27) 


In what follows, we review the derivatioiJ®! of the kev relation 112.24p given in refs. m- 
At t = 0, we can choose 


P(x,y,Q) = p(x-,Q)8{y) , (2.28) 

where p(x; 0) > 0 so that 112.24p holds trivially. In order to prove the relation 112.24p at 
arbitrary t > 0, we are going to show that each side of 112.241) can be rewritten as 


J dxdy 0[x + iy) P(x,y 1 t) 


dxdy 0(x + iy ; t ) P(x , y, 0) , 


(2.29) 


6 For an earlier work on this issue, see ref. [251 . 



































J dx 0(x) p{x;t) = J dx 0(x; t) p{x; 0) . (2.30) 

In eq. (12.2911 . we have introduced the time-dependent observables 0(z; t ) defined by solving 

j t O(z;t) = LO(z-,t) , (2.31) 

a as 

dz k dz k 

with the initial condition 

0(z; 0) = O(z) . (2.33) 

Let us recall that we are considering holomorphic observables O(z). One can actually 
show that the time-evolved observables 0(z;t) remain to be holomorphic when S(z) is a 
holomorphic function (3j. The observables 0(x;t) that appear in (I2.30P are obtained by 
setting y — 0 in 0(x + iy\ t ), and they satisfy the differential equation 

^ 0(x-t) = L 0 O(x;t) , 

d dS 
dx k dx k 

Since the right-hand sides of (I2.29[) and (12.30P are equal to each other due to (I2.28p . 
eqs. (12.29P and (I2.30p imply the desired relation (12.24p . 

In order to show (I2.29p . we introduce the function 



_d_ 

dx k 


(2.34) 

(2.35) 



_d_ 

dz k 


(2.32) 


F(t,r) 


dxdy 0(x + iy; r) P(x, y\t — r) , 


(2.36) 


which interpolates each side of (I2.29p with 0 < r < t. Taking the derivative with respect 
to r, we get 

F(t , r) = J dxdy LO(x + iy; r)P(x, y;t — r) — J dxdy 0(x + iy; t)L t P(x, y;t — t) , 

(2.37) 


where L T denotes the operator acting on P on the right-hand side of (I2.23p . The op¬ 
erator L is then defined as an operator satisfying ( Lf,g) = (f,L T g), where (f,g) = 
J f(x,y)g(x,y)dxdy, assuming that / and g are functions that allow integration by parts. 
The explicit form of the operator L can be obtained as 


L 



+ N r 



d 

dx k 



+ Ni 


d 

dyk 


d 

dyk 


(2.38) 
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An important observation here is that when L acts on a holomorphic function f(z) of z k , 
it can be replaced by L since 


Lf{z) 


-Re 

dS 


3S\ AT 8 
dz k ) + R dz k 


Lf(z) 


3 ]__ 

dz k 

df_ 

dz k 



(2.39) 


where we have used (12.221) . This implies that L in the hrst term of (I2.37P can be replaced by 
L, and hence (12.37ft vanishes if one can perform integration by parts. In that case, F(t, r) 
is independent of r, and (12.29ft follows. 

A similar argument can be used to show (12.301) . We dehne 


G(t, t ) 


Jd X 0( X -T) P ( X ;t 


T 


(2.40) 


which interpolates each side of (12.30ft for 0 < r < t. Taking the derivative with respect to 
r, we get 

-^-G(t,r) — j dx L 0 O(x;r)p(x]t — r) — J dx 0 (x;t)Lq p(x;t — r) , (2-41) 

where we have used (12.341) and (12.25ft . Here the integration on the right-hand side involves 
the real directions x k only, so we can perform integration by parts without any problem 
due to the effects of the action, which make p(x] t ) well localized. Thus (12.4ip vanishes, 
and (I2.30P follows. 

On the other hand, the integration by parts that one needs to use to show that (I2.37P 
vanishes involves the imaginary directions y k . It can therefore be justified only if the 
probability distribution P(x,y,t) has a sharp fall-off in the imaginary directions (3jilj. 

Recently it has been pointed out that the integration by parts can be invalidated also 
when the drift term includes a singularity [13]. This issue is relevant, in particular, to 
complex action systems involving fermions such as finite density QCD since the fermion 
determinant gives rise to a singular drift term. The CLM still works if the probability 
distribution P(x, y\ t ) is suppressed strongly enough near the singularity. 


3 “Gauge cooling” in O-dimensional systems 

At the end of the previous section, we discussed two possible problems, which can make 
the CLM give wrong results. The gauge cooling was originally proposed to cure the hrst 
problem [5] in gauge theories. In ref. [Hj we propose that it can be applied also to cure 
the second problem, and demonstrate that it does in the Random Matrix Theory for finite 
density QCD. In this section we consider the “gauge cooling” in 0-dimensional systems such 
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as the Random Matrix Theory and provide an explicit justification. Apart from pedagogical 
purposes, we consider that it is useful, in particular, in matrix models relevant to superstring 
theory mm- Generalization to the lattice gauge theory is straightforward and it is given 
in sections 0] and El 

Let us consider a system of N real variables with a symmetry under 

x'j = gjkX k , (3.1) 

where g is a representation matrix of a Lie group. An infinitesimal transformation is denoted 
as 


SXj tXjkXk • 


(3.2) 


Here A is an element of the Lie algebra, which can be expanded as 

Ajfc ^ ^ ^a(^a )jk 


(3.3) 


in terms of the generators t a of the Lie group under consideration with real coefficients 
A a G M . Upon complexifying the variables x k K > z k = x k + iy k , the symmetry of the action 
and the observables naturally enhances from (13.11) to 


z'j = gjkZ k , (3.4) 

where g is an element of the Lie group that can be obtained by complexifying the original 
Lie group. In particular, an infinitesimal transformation of the complexified symmetry is 
given by 

Szj = iXjkZk ■ (3.5) 

Here A is an element of the Lie algebra for the complexified Lie group, which can be 
expanded as (13. 3 j) but now with complex coefficients X a G C. 

As a simple example, let us consider an 0(77) vector model 

N N 2 

S(x) — a J^(x fc ) 2 + k j J^(x fc ) 2 | (3.6) 

k= 1 fc=l 

with a G C, which is invariant under (13.ip with g G O(IV). An infinitesimal transformation 
is given by (I3.2[) . where A jk is a purely imaginary antisymmetric N x N matrix. Upon 
complexification Xk i —> Zk = Xk + iyk, the action becomes 

N N 2 

s ( z ) = + K{j2{z k ) 2 } , (3.7) 

k= 1 k= 1 

which is invariant under (I3.4[) with g G 0(77, C), namely with g being an N X N com¬ 
plex matrix satisfying g T g = gg T = 1. (The symbol g T here represents the transpose of 
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the matrix g.) An infinitesimal transformation is given by (13.5|) . where \j k is a complex 
antisymmetric N x N matrix. 

The discretized version of the complex Langevin equation (12. 17ji can be written as 

z { k\t + e) = z^\t) - e|^ + s/er] k (t) , (3.8) 

analogously to (12. lip . The probabilistic variables 

Vk(t) = +iVk\t) (3-9) 


is a 


obey the probability distribution e 4Z " 41jv r n, 'k The gauge cooling 

procedure of making a complexified symmetry transformation (I3.4[) between the Langevin 
steps. Thus it amounts to modifying the complex Langevin equation (j3.8[) into 


z [ k\t) = 9kizl v \t) , 

ZkH* + e ) = + 'fiVkit) , (3.10) 

where g is an element of the complexified Lie group chosen appropriately as a function of 
the configuration before cooling. The basic idea is to determine g in such a way that the 
modified Langevin process (13. 10ft does not suffer from the problem of the original Langevin 
process (13.8p . Clearly this idea will have a chance to work only if the degrees of freedom 
in the symmetry transformation have at least the same order of magnitude as those of the 
dynamical system itself. Gauge theories are one such example, but 0-dimensional models 
such as vector models and matrix models would be equally good as demonstrated explicitly 
in the Random Matrix Theory ra- 

For instance, if the excursions in the imaginary directions are problematic in studying 
the model (13. 6 p by the CLM, one can introduce the norm@ 


N ^ N 

Af = ; B») 2 = ~; 5 >* - « 2 . 

fc=l ^ k=1 


(3.11) 


which measures the distance from the real region, and determine the transformation g in 
(13.1 Op in such a way that the norm JV for z^\t) is reduced by the transformation. In 
ref. [3] we propose to combine this norm with another norm to cure also the problem 
caused by a singular drift term. Typically the norm one tries to reduce is invariant under 
transformations in the original Lie group but not under transformations in the complex¬ 
ified Lie group as in the case of (13. lip . The main issue we address below is whether the 
modification of the Langevin process by the “gauge cooling” spoils the equivalence to the 
path integral reviewed in section 12.31 

7 This is analogous to the so-called “unitarity norm” [5] proposed in the complex Langevin simulation 
of lattice gauge theory. 
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Note that the gauge cooling is a completely deterministic procedure. In particular, the 
transformation g in (13. lOj) is determined only by the configuration z^(t) before cooling. 
Therefore, for our purpose, it is convenient to rewrite (13.101) into the form 

z k\t + e ) = 9ki (zi V \t) ~ + y/erik(t) , (3.12) 

which involves z ^ (£) onlyjf] Let us assume that g is given by 

g = exp {zeA(x (T?) (£),t/ (?7) (f))} , (3.13) 


where A (x,y) is an element of the Lie algebra of 0(N, C). For instance, one may use 

A ki(x,y) = a(x,y)(x k yi-y k xi) = ^ia(x,y)(z k z*-zlzi) , (3.14) 

which can be obtained by calculating the gradient of the norm (13. Ill) with respect to the 
0(N,<C) transformation. The real positive function a(x,y) can be chosen to optimize 
the reduction of the norm. (In practice, one can simply determine g by using the steepest 
descent method with respect to the norm (13.lip [5].) Note that A(x, y) is not a holomorphic 
function of Zk in general as in (13.141) . 

Using (13. 13j) in eq. (13. 12ft and taking the e —> 0 limit, we obtain the continuum complex 
Langeviu equation for z^(t) as 

z { k\t) = + %(t) + i\ k i(x {ri \t),y {ri \t)^ z[ v \t) , (3.15) 


where the effect of the gauge cooling is represented by the last term on the right-hand side. 
Then we can easily find that the FP-like equation (I2.23P that P(x,y, t ) satisfies is modified 
by the gauge cooling as 


dP 

~dt 


d 


dx k 


/ dS 

Re -- i\u(x,y)zi ) + N R 


_d_ 

dx k 


d 

dyk 


\dz, 

T f ds x \ Ar d 

Im I — - i\ki{x, y)zi I + iVi — 


P . 


(3.16) 


This modifies the differential operator L in eq. (I2.38j) into 

r f „ f dS . . A Ar d 1 d 

\ {Wy ,Mx - v)zi ) +Nr fol] d7 t 

+ { _Im (S" !AH(l ’ !,)2:, ) +JV, ^}lL (3 ' 17) 

8 In practice, one usually measures observables using the configuration z^ v \t) after cooling instead of 
z^ v \t). This does not cause any problem since z^ v \t) and z^ v \t) are related to each other by the complex¬ 
ified symmetry transformation (USD, under which the observables are invariant. 
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Acting this modified L on a holomorphic function f(z), we obtain 


/ 8 \ 8 
Lf{z ) = ( y)z l + (iV R - Ni)— ) ^—/(*) 

V dzk J oz k 

= Lf(z ) + y)zr^-f(z) , (3.18) 

OZk 

where L is defined by (12.321) . The extra term compared with (12.391) represents the change of 
f(z) under an infinitesimal O (N, C) transformation. Note that the time-evolved observables 
0(z; t ) defined by (I2.3ip remain to be invariant under the 0(JV, C) transformation as long as 
the action S(z) and the original observables O(z) are invariant. Therefore, the L in the first 
term of (I2.37|) can be replaced by the modified L. Hence (12.371) vanishes if one can perform 
integration by parts for the modified L and P. In that case, the crucial identity (12.241) holds 
for the modified P with the same p. Thus we have shown explicitly that the “gauge cooling” 
provides a possibility to improve the property of the probability distribution P(x, y; t ) so 
that (12.241) holds, without affecting the FP equation (I2.25j) for p(x;t). 


4 Application of the CLM to lattice gauge theory 

In this section we discuss the application of the CLM to lattice gauge theory, which is 
defined by the partition function 


Z= j dUe~ s{u) 


JdU n ,e~ s W , 


(4.1) 


where the action S' is a complex-valued function of the configuration U = {U nfl } composed of 
link variables U nfl G SU(3), and the integration measure dU nfl represents the Haar measure 
for the SU(3) group. The only complication compared with the case discussed in the 
previous sections comes from the fact that the dynamical variables take values on a group 
manifold. The Langevin equation in such a case with a real action is discussed intensively 
in refs. [2911301 EH EH |33]. Using this formulation, we can easily generalize our discussions 
to the case of lattice gauge theory. 

When the action S is complex, the drift term in the Langevin equation makes the 
link variables evolve into SL(3,C) matrices (i.e., 3x3 general complex matrices with the 
determinant one) even if one starts from a configuration of SU (3) matrices. Let us therefore 
complexify the link variables as U nfJ , G SL(3,C), and solve the complex Langevin equation 

Wfi’w = ( - V ^S(U) + >)„„„(()) , (4.2) 


where the action S(U) is now considered as a holomorphic function of the complexified 
configuration U nil , and t a are the generators of the SU(3) group normalized by tr (t a h) = dab- 
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The probabilistic variables r] anfl (t ) are defined similarly to (13.9j) . The derivative operator 
V an V is defined a^| 


V = -(V^ - iV^ ) 

^any, g ^ any) J 

(4.3) 


5 

21=0 

(4.4) 

V‘Zf(U) = df { e-*‘U n „) 

y =o 

(4.5) 

Here f(U) are functions on the complexified group manifold, which are 
holomorphic. and x and v in eqs. (14.41) and (I4.5j) are real parameters, 
holomorphic function f{U ), we have V anil f(U) = 0, where 

not necessarily 
Note that for a 

= 5 Ah + Ah 

, 

(4.6) 

and hence 



= *W(«) . ■ 

(4.7) 

Then we define the probability distribution 



p(u\t)={ n«(« w Ajw)), 

(4.8) 


nfi 


where the delta function is defined by 

J dUf(U) d(u nil .U ni )j = m ( 4 . 9 ) 

for any function f(U). The integration measure that appears on the left-hand side represents 
the Haar measure for the SL(3, C) group normalized appropriately. One can show that the 
probability distribution P(U ; t ) obeys the FP-like equation (See Appendix [A] for derivation.) 

=®£i {Re(lW?(H)) + iV B C™ } P + X>£„ {im(c„„S(M)) + JV.B®,} P . 

( 4 . 10 ) 

In fact, for observables 0(U ) that admit holomorphic extension to 0(U), one can show 
under certain conditions that there exists a complex function p(U;t), which satisfies 

J dU OilA) P(U] t) — j dU 0(U) p(U] t) , ( 4 . 11 ) 

9 The derivative operators defined in eqs. m and m may be regarded as analogues of ^ = ^(Jy ~ 
*^) and m = s(& +*!;)> respectively. 
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and obeys the FP equation 

—p(U]t) = D an/l (^D anil S{U) + D an4 )j p(U]t ) . (4.12) 

Here we have dehned the derivative operator D anil , which acts on a function f{U) of the 
unitary gauge configuration as 

(4.13) 

2 = 0 

Note that the FP equation (14.12j) has a time-independent solution 

Ptime—indep (U') = 6Xp ( - S(U)j . (4.14) 

As we argued in section 12.31 the convergence to (14.141) should occur if the relation (14.lip 
holds and the FP-like equation (14.10(1 converges to some function uniquely. In that case, 
we can calculate the VEV with respect to the partition function (14.ip as 


D an ,f(U) = ^/(r u 'H',,,) 


(0)= / dUO(U) Ptime . indep (U) 


= lim / dUO(U)p(U]t) 
t— >oo J 

= lim / dUO(U)P(U-t) 

t —^oo J 

=l 

i rto+T 

= T lto-) . (4.15) 

Let us briefly discuss how one can derive the relation (14.lip . At t = 0, we choose 


W;Q)= [ dUp(u-,o)l[5(u nil ,u n , 

nfi 


(4.16) 


with p(U ; 0) > 0 so that (14.111) holds trivially. In order to prove the relation (14.lip at 
arbitrary t > 0, we are going to show that each side of (14.111) can be rewritten as 


dU 0(U) P(U■ t)= dU 0{U ; t) P(U ; 0) , 


(4.17) 


dU 0(U) p(U;t) — dUO(U\t)p(U:0) . 


(4.18) 


In eq. (14.17h . we have introduced the time-dependent observables 0(U: t) dehned by solving 


do(lf\t) — LO(U; t) 


(4.19) 


16 






















L= (V an ^-V anil S{U))V, 


(4.20) 


an fa 

with the initial condition 


0(U- 0) = 0(U) . (4.21) 

Let us recall that we are considering holomorphic observables 0(U). One can actually 
show that the time-evolved observables 0(U\t) remain to be holomorphic when S(U) is a 
holomorphic function [3], The observables 0(U;t ) that appears in 04.181) are obtained by 
setting U = U in 0(U\ t ), and they satisfy the differential equation 

jO(U-t) = UO(U-t) , (4.22) 

L 0 = ^ D anil — D anil S{U)^jD antl . (4.23) 

Since the right-hand sides of 04.171) and 04.181) are equal to each other due to (I4.16p . 
eqs. 04.171) and 04.181) imply the desired relation 04.111) . 

In order to show (I4.17p . we introduce the function 

F(t, t) = J dU OiU ; r) P{U\ t - r) , (4.24) 

which interpolates each side of (I4.17p with 0 < r < t. Taking the derivative with respect 
to r, we get 

J^F(t, t) = J dU LO(U ; r) P(U; t - r) - J dU 0(U ; r)L T P(W; t - r) , (4.25) 

where L T denotes the operator acting on P on the right-hand side of (14. lOH . The operator L 
is then defined as an operator satisfying ( Lf,g) = ( f,L T g ), where (f,g) = f f(U)g(U)dU, 
assuming that / and g are functions that allow integration by parts. The explicit form of 
the operator L can be obtained as 

L = {-Re(l>„„„S(W)) + Wr®™ } P™ + {-Im(p a „„S(M)) + N{D^) D®, . (4.26) 

An important observation here is that when L acts on a holomorphic function f(U), it can 
be replaced by L since 

LfiU) = {-Re(£> an/1 S(W)) + N R V anfI } V an J(U) 

+ |-lu.('P fm// 5(W)) + iN{D an ^ iV an J(U) 

= {-V an ^S{U) + (N r - lV : )P anM } V anfl f(U) 

=Lf(U) , (4.27) 
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where we have used (14.71) and (I2.22p . This implies that L in the first term of (j4.25j) can be 
replaced by L, and hence (14. 25 jl vanishes if one can perform integration by parts. In that 
case, F(t, r) is independent of r, and (14.171) follows. 

A similar argument can be used to show (14.18!) . We define 

G(t, t) = J dU OiU ; r) p(U-1 - r) , (4.28) 

which interpolates each side of (I4.18|) for 0 < r < t. Taking the derivative with respect to 
r, we get 

A G(t,r)= I dU L 0 O(U ; r) p(U ; £ — t) — J dU 0(U;r)L^p{U't - r) , (4.29) 

where we have used (14.221) and (I4.12p . Here the integration on the right-hand side involves 
the real directions only, which are compact in the present case, so we can perform integration 
by parts without any problem to show that (I4.29P vanishes. Thus G{t , r) is independent of 
r, and (14.181) follows. 

On the other hand, the integration by parts in (14.251) is justified only if the probability 
distribution P{U\ t) has a sharp fall-off in the noncompact imaginary directions. The gauge 
cooling |5] was originally proposed to solve this problem. As we mentioned in section 12.31 
the integration by parts can be invalidated also when the drift term includes a singularity 
[T3] . This problem is anticipated to occur when one applies the CLM to finite density QCD 
at low temperature with light quarks. The CLM still works if the probability distribution 
P(U;t) is suppressed strongly enough near the singularity. We consider that the gauge 
cooling is useful also in solving this problem as in the case of Random Matrix Theory [14] . 

5 Gauge cooling in lattice gauge theory 

The lattice gauge theory is invariant under the SU(3) gauge transformation. For instance, 
the plaquette action 

•Splaquette ( U) /3 EE tr ) (5*l) 

n 

is invariant under 

Un^L ~ 9nUn^g n+ (x ? (^*2) 

where g n G SU(3). An infinitesimal transformation is denoted as 

U n ^ L \ n -\-jj^j . (5.3) 

Here X n is an element of the Lie algebra, which can be expanded as 

E ^na (ta)jk (5.4) 

a 
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in terms of the generators t a of SU(3) with real coefficients \ na G M . 

When one complexifies the variables U nil H y U. n]l e SL(3,C), the symmetry of the 
action and the observables naturally enhances to the SL(3, C) gauge symmetry that can be 
obtained by complexifying the original Lie group. For instance, the plaquette action (15.11) 
becomes 


(5.5) 


which is invariant under 


Kp = fhMn^nU ( 5 ‘ 6 ) 

with g n e SL(3,C). An infinitesimal transformation is given by 

• (5.7) 

Here A is an element of the Lie algebra for the complexified Lie group, which can be 
expanded as fl5.4j) but now with complex coefficients X na e C. 

The discretized version of the complex Langevin equation (14. 2 J) can be written as 

Unfit + e ) = exp{i^ ( - eV anfl S{U ) + v^7«m M (*)) . (5.8) 

a 

The gauge cooling [5] modifies the complex Langevin equation (15. 8 j) into 
U^(t) = gnU^(t) g ~l ll , 

W'j'ft + ') = expjij] ( - eV am S(U) + ~/fr hn/ Jt)j t a Ju^(t) , (5.9) 

a 

where g n is an element of the complexified Lie group. The basic idea is to determine g n in 

such a way that the modified Langevin process (15.91) does not suffer from the problem of 

the original Langevin process (15.8|) . 

For instance, if the excursions in the imaginary directions are problematic, one can 
introduce a positive semi-definite quantity [33] (We call it the “norm” in this paper.) 

M = 5> («♦„«„„- 1) . (5.10) 

n[i 

which measures the distance from the unitary region, and determine the transformation 
g n in (15. 9 p in such a way that the norm A f for U( v \t) is reduced by the transformation. 
Typically the norm one tries to reduce is invariant under transformations in the original Lie 
group but not under transformations in the complexified Lie group as in the case of (15.10[) . 
Below we demonstrate that the modification of the Langevin process by the gauge cooling 
does not spoil the equivalence to the path integral reviewed in the previous section. 
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Note that the gauge cooling is a completely deterministic procedure. In particular, the 
transformation g n in (15. 9 p is determined only by the configuration U^ v \t) before cooling. 
Therefore, for our purpose, it is more convenient to rewrite (15. 9 p into the form 

(* + e) = 9n exp Y ( - eV anfl S(U)t a + y/ena n ^t)g~H a g^j } (t)c/"+ A , (5.11) 

a 

which involves (£) onlvf^l Let us assume that g n is given as 

g n = exp jzeA n (w (??) (f)}} , (5.12) 

where A n (U) is an element of the Lie algebra of SL(iV, C). For instance, one may use 

\ n (U) = ia{U) Y {[UnMnp - ~ (trace part)} , (5.13) 

which can be obtained by calculating the gradient of the norm (I5.10P with respect to the 
SL(3,C) gauge transformation of the configuration U. The real positive function a(U), 
which is not necessarily holomorphic, can be chosen to optimize the reduction of the norm. 
(In practice, one can simply determine g n by using the steepest descent method with respect 
to the norm (15. lOf) (5j .) Note that X n (U) is not a holomorphic function of U nfl in general as 
in (EH. 

Using (15.12)1 in eq. (15. lip and taking the e —>■ 0 limit, we obtain the continuum complex 
Langevin equation for U^ v \t) as 

=i£ ( - Von „S(U) + r h nn(t)) t a U^(t) 

a 

+ t{ K («'” (t)) u<$ (t) - l4j> ( t ) X n+ , («(') (())}, (5.14) 

where the effect of the gauge cooling is represented by the last term on the right-hand side. 
Then we can easily find that the FP-like equation (14.10p that P(U;t) satisfies is modified 
by the gauge cooling as 



"t" } V an ,S(U) - C an ,) + N{D^} P , (5.15) 

= tr {*« (An(W) - U np \ nVll {U) U~l) } • (5.16) 

This modifies the differential operator L in eq. (14.26P into 

L = {-Re(p a „,S(M) - Cnnn) + JV R Ug.} 

10 In practice, one usually measures observables using the configuration U^(t) after cooling instead 
of U^ v \t). This does not cause any problem since and U^(t) are related to each other by the 

complexified symmetry transformation m , under which the observables are invariant. 
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(5.17) 


+ {-Im(©„„S(M) - C„„„) + N{D ,£>„} pro„. 

Acting this modified L on a holomorphic function f(U), we obtain 

Lf{U) = {-V anfl S(U ) + C an , + ( N r - Ni)V an/1 } V an ,f(U ) 

= Lf(U)+C anfl V an J(U) , (5.18) 

where L is defined by (I4.20p . The extra term compared with (j4.27|) represents the change 
of f(U) under an infinitesimal SL(3, C) gauge transformation. Note that the time-evolved 
observables 0(U;t) defined by (14.191) remain to be invariant under the SL(3,C) gauge 
transformation as long as the action S(U) and the original observables 0(U) are invariant. 
Therefore, the L in the first term of (I4.25P can be replaced by the modified L. Hence 
(14.251) vanishes if one can perform integration by parts for the modified L and P. In that 
case, the crucial identity (14. lip holds for the modified P with the same p. Thus we have 
shown explicitly that the gauge cooling provides a possibility to improve the property of 
the probability distribution P{U\ t ) so that (14. lip holds, without affecting the FP equation 
®T2D for p(U-t). 

6 Summary and discussions 

In this paper we have provided a rigorous justification of the CLM with the gauge cooling 
procedure. As we have reviewed in detail, the CLM relies crucially on the relation between 
the probability distribution P associated with the complex La.ngevin process and the com¬ 
plex weight p associated with the original path integral problem. This relation holds if and 
only if the probability distribution P satisfies the following two properties. One is that 
it has a rapid fall-off in the imaginary directions in the complexified configuration space 
M. The other is that the distribution is such that complexified configurations which 
make the drift term singular are strongly suppressed [13]. Since the gauge cooling modifies 
the probability distribution P, one may hope to make it satisfy the above two properties by 
appropriately choosing the complexified symmetry transformation to be used in the cooling 
procedure. What we have shown in this paper is that the modification of the probability 
distribution P due to the gauge cooling does not alter the FP equation that the complex 
weight p obeys if the relation between P and p holds at all. 

For a long time it has been thought that the convergence of the FP equation for the 
complex weight p is not guaranteed unlike in the real action case. However, once the 
relation between the probability distribution P and the complex weight p is established, 
one may argue [13] that the unique convergence of the probability distribution P already 
implies the unique convergence of p to the desired complex weight e~ s . Therefore, if one 
can satisfy the above two properties of the probability distribution P by using the gauge 
cooling appropriately, the CLM is guaranteed to give the correct results. 

While the gauge cooling certainly enlarges the range of applicability of the CLM, it 
remains to be seen how powerful it is in studying various interesting systems with complex 
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actions. In this regard, our results for the Random Matrix Theory using the gauge cooling 
with a new type of norm S3! look very promising. 
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A Derivation of eq. ( 14.101) 


In this appendix, we derive the Fokker-Planck like equation 114.10)1 to make this paper self- 
contained. Here we deal with the continuous t for simplicity, but one can make a similar 
analysis with discretized t as we have done in section 12.21 

Let us consider a test function f{U) and its expectation value 


(/(H W W)) = f dUf(U)P(U-,t) . 


Taking the derivative with respect to the fictitious time t, we get 

The left-hand side can be evaluated as follows. 

4 (/(«<’>(*)) )__=({- Re(».,S(« w (f))) + O*)} Z>S./(W W M )) v 

T ^ ^ L11 (Van/i S(M<”'(«))) +>£!„(<)}P®,/(z/yi)))^ . 

Here we use the following formula. (See ref. [IS], for instance.) 


(s(w (,,, ( ( ))’?£w( i )) t| = 



Using (1A.4|1 and (1A.5|) in (1A.3I) . we get 


I('( mW <*>)>, 


(A.l) 

(A.2) 


(A.3) 

(A.4) 

(A.5) 
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^Re yD an ^ S(Uf">(t))) VjVf(uM(t)))" + (A rRP™Z>™f(U M (t) 

^im {r> an ^ 

= j dU P(U-t)[- Re (z> ow S(M)) !)«/(«) + AT„I>W pM /(W ) 

- Imfc aw S(M)')©2L/(M) + M ■D®V®J(U) 


= j Mf(U)[vW {Re(p a „,S(M)) +JV b P™}p 

+ X>2> {imfp^S(M)') + P 


(A.6) 


Plugging this expression in (1A.2I) . and using the fact that (1A.2j) should hold for an arbitrary 
f(U), we get eq. (14.10p . 
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